Annotating a Japanese Text Corpus with Predicate-Argument and Coreference Relations
نویسندگان
چکیده
In this paper, we discuss how to annotate coreference and predicate-argument relations in Japanese written text. There have been research activities for building Japanese text corpora annotated with coreference and predicate-argument relations as are done in the Kyoto Text Corpus version 4.0 (Kawahara et al., 2002) and the GDATagged Corpus (Hasida, 2005). However, there is still much room for re ning their speci cations. For this reason, we discuss issues in annotating these two relations, and propose a new speci cation for each relation. In accordance with the speci cation, we built a large-scaled annotated corpus, and examined the reliability of our corpus. As the result of our current work, we have released an annotated corpus named NAIST Text Corpus, which is used as the evaluation data set in the coreference and zero-anaphora resolution tasks in Iida et al. (2005) and Iida et al. (2006).
منابع مشابه
Corpus based coreference resolution for Farsi text
"Coreference resolution" or "finding all expressions that refer to the same entity" in a text, is one of the important requirements in natural language processing. Two words are coreference when both refer to a single entity in the text or the real world. So the main task of coreference resolution systems is to identify terms that refer to a unique entity. A coreference resolution tool could be...
متن کاملFlow Graph Corpus from Recipe Texts
Academic Center for Computing and Media Studies, Kyoto University Graduate School of Informatics, Kyoto University Yoshidahonmachi, Sakyo-ku, Kyoto, Japan [email protected], [email protected], [email protected], [email protected] Abstract In this paper, we present our attempt at annotating procedural texts with a flow graph as a representation of underst...
متن کاملPredicate-argument Structure based Textual Entailment Recognition System of KYOTO Team for NTCIR-10 RITE-2
We participated in Japanese tasks of RITE-2 in NTCIR10 (team id: “KYOTO”). Our proposed method regards predicate-argument structure as a basic unit of handling the meaning of text/hypothesis, and performs the matching between text and hypothesis. Our system first performs predicate-argument structure analysis to both a text and a hypothesis. Then, we perform the matching between text and hypoth...
متن کاملJoint Case Argument Identification for Japanese Predicate Argument Structure Analysis
Existing methods for Japanese predicate argument structure (PAS) analysis identify case arguments of each predicate without considering interactions between the target PAS and others in a sentence. However, the argument structures of the predicates in a sentence are semantically related to each other. This paper proposes new methods for Japanese PAS analysis to jointly identify case arguments o...
متن کاملToward Text Understanding: Integrating Relevance-tagged Corpus and Automatically Constructed Case Frames
This paper proposes a wide-range anaphora resolution system toward text understanding. This system resolves zero, direct and indirect anaphors in Japanese texts by integrating two sorts of linguistic resources: a hand-annotated corpus with various relations and automatically constructed case frames. The corpus has relevance tags which consist of predicate-argument relations, relations between n...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007